Reinforcement learning

Results: 1147



#Item
891Stochastic control / Partially observable Markov decision process / Markov decision process / Reinforcement learning / Heuristic function / Mathematical optimization / Statistics / Dynamic programming / Markov processes

Optimally Solving Dec-POMDPs as Continuous-State MDPs Jilles Steeve Dibangoye Inria / Universit´e de Lorraine Nancy, France [removed]

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2013-08-12 12:17:22
892Stochastic control / Partially observable Markov decision process / Markov decision process / Reinforcement learning / Danger Hiptop / Economic model / Borland Sidekick / Simulation / S0 / Statistics / Dynamic programming / Markov processes

POMCoP: Belief Space Planning for Sidekicks in Cooperative Games Owen Macindoe, Leslie Pack Kaelbling, and Tom´as Lozano-P´erez CSAIL, 32 Vassar Street Cambridge, Massachusetts, [removed]Abstract

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2012-10-18 15:45:42
893Cybernetics / Dynamic programming / Stochastic control / Reinforcement learning / Computational neuroscience / Markov decision process / Machine learning / Intelligent agent / Reinforcement / Statistics / Artificial intelligence / Science

Journal of Machine Learning Research[removed]1371 Submitted 10/09; Revised 6/11; Published 5/12 Transfer in Reinforcement Learning via Shared Features George Konidaris

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2012-06-08 19:50:16
894Reinforcement learning / Parametrization / Statistics / Estimation theory / Statistical theory / Coordinate systems / Dimensional analysis / Measurement

Transfer Learning by Discovering Latent Task Parametrizations George Konidaris MIT CSAIL Cambridge, MA[removed]removed]

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2012-11-30 15:08:20
895Stochastic control / Partially observable Markov decision process / Markov decision process / Reinforcement learning / Multi-agent system / Agent-based model / Heuristic function / Affect / A* search algorithm / Statistics / Dynamic programming / Markov processes

Heuristic Search of Multiagent Influence Space Frans A. Oliehoek1 , Stefan Witwicki2 , and Leslie P. Kaelbling1 1 2

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2011-12-05 10:35:12
896Reinforcement learning / Machine learning / Support vector machine / Nonlinear dimensionality reduction / Linear regression / Statistics / Econometrics / Regression analysis

Learning Parameterized Skills Bruno Castro da Silva [removed] Autonomous Learning Laboratory, Computer Science Dept., University of Massachusetts Amherst, 01003 USA. George Konidaris

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2012-08-29 16:33:32
897Fourier analysis / Numerical analysis / Linear algebra / Joseph Fourier / Spectral theory / Fourier series / Proto-value functions / Gibbs phenomenon / Basis function / Mathematical analysis / Mathematics / Algebra

Value Function Approximation in Reinforcement Learning using the Fourier Basis George Konidaris1,3 1 MIT CSAIL [removed]

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2012-06-08 19:46:17
898Stochastic control / Markov models / Partially observable Markov decision process / Reinforcement learning / Markov decision process / Feature selection / Statistics / Dynamic programming / Markov processes

LNAI[removed]Sequential Feature Selection for Classification

Add to Reading List

Source URL: www.rueckstiess.net

Language: English - Date: 2011-12-08 22:18:53
899Reinforcement learning / CMA-ES / Dimensional analysis / Applied mathematics / Science / Estimation theory / Statistics / Measurement

Policy Gradients with Parameter-based Exploration for Control Frank Sehnke1 , Christian Osendorfer1 , Thomas R¨ uckstieß1 , 1 3

Add to Reading List

Source URL: www.rueckstiess.net

Language: English - Date: 2009-11-26 21:08:11
900Oilfield terminology / Petroleum engineering / Viscosity / Signal-to-noise ratio / Navier–Stokes equations / Noise / Normal distribution / Reinforcement learning / Statistics / Measurement / Engineering

Motor Learning at Intermediate Reynolds Number: Experiments with Policy Gradient on a Heaving Plate John W. Roberts, Jun Zhang and Russ Tedrake Abstract This work describes the development of a model-free reinforcement

Add to Reading List

Source URL: groups.csail.mit.edu

Language: English - Date: 2009-08-27 12:26:43
UPDATE